Gompertz mortality law and scaling behaviour of the Penna model 
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The Penna model is a model of evolutionary ageing through mutation accumulation where tradi- 
tionally time and the age of an organism are treated as discrete variables and an organism's genome 
by a binary bit string. We reformulate the asexual Penna model and show that a universal scale in- 
variance emerges as we increase the number of discrete genome bits to the limit of a continuum. The 
continuum model, introduced by Almeida and Thomas in [Int. J. Mod. Phys. C, 11, 1209 (2000)] can 
be recovered from the discrete model in the limit of infinite bits coupled with a vanishing mutation 
rate per bit. Finally, we show that scale invariant properties may lead to the ubiquitous Gompertz 
Law for mortality rates for early ages, which is generally regarded as being empirical. 

PACS numbers: 87.23.-n, 87.10. +c 



I. INTRODUCTION 

The Penna model was devised in 1995 by T. J. P. Penna 
to model the process of evolutionary ageing through 
mutation accumulation. The idea that natural selection 
would permit behaviour such as ageing is initially baf- 
fling: it would seem that survival of the fittest would 
remove any such detrimental behaviour. Medawar pro- 
posed 0] that certain genes may be age specific in their 
effects; if such genes are harmful and are activated late 
on in the reproductive life of an organism, natural selec- 
tion against them will be much weaker than if they had 
become active earlier in the organism's life. Given the ex- 
istence of such genes, it can be anticipated that harmful 
genetic conditions will become more common as an or- 
ganism ages giving rise to increasing mortality rates with 
age. The Penna model is a means to model the evolu- 
tion of an age-structured population under the influence 
of age-specific harmful mutations 3] . 

Traditionally, mortality rates are known to rise expo- 
nentially for early ages, giving rise to the Gompertz law 
Q of mortality. More recent experiments using much 
larger sample populations have shown that the mortality 
rate for advanced ages is seen to slow substantially giving 
rise to a mortality plateau or peaks H, IE S @ • ^ has 
been shown that a modified Penna model while continu- 
ing to show Gompertz growth in mortality rates at early 
ages can also exhibit a mortality plateau at advanced 
ages @. 

The original Penna model is discrete in nature, with 
time represented by an integer and an organism's genome 
by a bit-string. Each on the bit-string represents a 
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healthy site; each 1 is a harmful mutation which becomes 
active once the organism reaches age x where x is the in- 
dex of the site on the bit-string. Having activated T 
harmful mutations an organism dies. The bit-string is 
taken to be finite in length (usually 32 bits) and each 
newborn organism has a number of mutations M intro- 
duced into the bit-string. These mutations are taken to 
be harmful so can only turn healthy sites into unhealthy 
ones - a mutation on an unhealthy site is ignored. This 
assumption is relaxed in |l(j where a small rate of posi- 
tive mutation is allowed: we confine ourselves here to the 
case of only harmful mutations. 

Scaling behaviour was considered by Malarz |ll| who 
investigated the effects of different bit string lengths on 
the Penna model. Malarz inquired as to whether large 
bit strings were required or whether one could expect, 
after appropriate scaling of other parameters, one would 
get the same results for different genome lengths. Inves- 
tigating the effects of string length through simulation, 
Malarz was unable to find scaling in the Penna model. 

Almeida et al. [l2j later considered a continuous Penna 
model and for certain mutation regimes were able to find 
simple scaling relations. To obtain such scaling the au- 
thors decoupled the string-length and mutation rate so 
that the probability of finding a given number of mu- 
tations in a given string length was given by a poisson 
distribution. They also observed that the Penna model 
is able to sustain a maximum possible lifespan in steady- 
state, which we call / m a X : if the imposed string length is 
greater than l max then it will have no effect on the prop- 
erties of the population; if it is less than Z max then the 
imposed string length will impose a maximum lifespan on 
the population and the distribution will be accordingly 
altered. The authors suggested that the size of timesteps 
in a discrete Penna model may have an effect on scaling 
behaviour but did not investigate the size or nature of 
this effect. 

Brigatti et al. investigated scaling in a sexual 
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Penna model through simulation and suggested that re- 
sults from the continuum model of Almeida et al. 01 
were not readily mapped onto the discrete model em- 
ployed in simulation. Scaling effects in the sexual model 
were also investigated by Laszkiewicz et al. 0] through 
simulation. 

In this paper we extend our previous analytical solu- 
tion of the asexual model [jj, 0] to examine the scaling 
behaviour. We show that the scale invariance emerges as 
we increase the number of discrete genome bits, and that 
the scaling becomes exact in the continuum case, which 
can be regarded as the limit of infinite genome bits cou- 
pled with a vanishing mutation rate per bit. This estab- 
lishes a clear relationship between the distribution, pa- 
rameters and scaling behaviour of the continuum model 
of Almeida et al. and those of the traditional discrete 
model. Finally, we use scale invariance to analytically 
show that at early ages mortality rates grow exponen- 
tially in accordance with the Gompertz law |4| which, 
for the lack of a general proof, is still generally regarded 
as being empirical [l6[ . 



II. A CONTINUOUS PENNA MODEL 

The asexual Penna model can be reformulated [l2L 
so that rather than considering discrete timesteps, time 
is treated as a continuous variable, t. The bit-string of an 
organism is replaced by an axis representing the genome: 
position x on the genome is examined at age x. Harmful 
mutations are then represented by 5 functions along the 
genome. After accumulating T 8 functions an organism 
dies. 

For our analytical solutions, we concern ourselves pri- 
marily with T = 1 as generalizing a T = 1 solution for a 
continuous model will be no more difficult than general- 
izing a discrete T = 1 model, as done previously [jj- In 
the continuum Penna model an organism reproduces at 
a constant rate b and dies at age x where its genome has 
its first harmful mutation (<5 function) at position x. 

An organism can be characterised by its age x and its 
genetic lifespan / (the position on the genome of the first 
harmful mutation). Neither x nor I are constrained to 
be integers. n(x, I) is now a density of organisms so that 
the number of organisms with age and genetic lifespan 
in the range x — > x + dx and Z — >■ Z + dZ is given by 
n(x, l)dxdl. The probability of giving birth in time dt is 
given by bdt, the probability of a mutation being intro- 
duced in length dl is given by /3d/. These definitions are 
consistent with the discrete Penna model where sites can 
be interpreted as infinitessimal lengths of genome and 
timesteps as infinitessimal units of time. The probability 
of no mutations occuring in length dl is 1 — (3dl which is 
e -/3di £ or m fi n itessimal dl. 

Newborn organisms may be produced as unmutated 
copies of organisms with equal genetic lifespan, or as 
mutated copies of naturally longer lived organisms. An 
equation can then be constructed for the production of 



new organisms within the population for the infinitesimal 
time period of t to t' = t + dt 

n(0, Vjfdt dl = bdtdl e~ fM / da; n(x, l) t 
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where subscripts t and t' denote time. At steady-state, 
the subscripts may be dropped, the above equation can 
be simplified and an expression obtained [|J, fl5| for the 
relative sizes of population densities (see Fig.QJ. 
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For a steady-state to exist there must be a longest 
lived sub-population which is self-sustaining, i.e. not re- 
liant on mutated births. No other sub-population can be 
self-sustaining if the population is to remain bounded, 
as shorter-lived organisms can always be created by mu- 
tated copies of longer- lived ones. For the longest-lived 
sub-population to be self-sustaining, each organism must 
produce one perfect copy of itself during its lifetime 
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All other populations, with I < l ma x, gain from mutated 
births of the longest lived, so unmutated birth per indi- 
vidual must, on average, be less than unity 
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These conditions can be combined to give 
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In the discrete Penna model the probability of no mu- 
tation for 1 site or bit is 1 — m where m is the muta- 
tion rate per site. The probability for I sites without 
mutations would be (1 — my. In the limit of m — » 0, 
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and therefore m play the same role as (3 



in the continuous case, where the probability of no mu- 
tations in genome length I is e~^' . Thus we can identify 
the continuum Penna model as the limit of the discrete 
model as the mutation rate per site tends to zero. For 
vanishingly small units of discretization, a discrete model 
becomes a continuous one. A measure of the extent of 
discretization is the size of one of the discrete units di- 
vided by the total size of the system; for the Penna model 
this is j-=— . As the extent of discretization gets smaller, 
/max tends to infinity which implies a vanishing mutation 
rate. Thus, the two limits of mutation rate tending to 
zero, and of increasingly fine grained discretization, are 
identical. 
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FIG. 1: A plot of genetic lifespan distribution for a discrete 
( x ) and continuous Penna model with Z max = 30. 



III. SCALING PROPERTIES 



FIG. 2: A plot of genetic lifespan distribution for an unsealed 
Penna model with Z max = 20 (+), a model with Z max = 200 
scaled down by a factor of 10 (x) and a model with Z max = 
2000 scaled down by a factor of 100 (o). Only comparable 
points have been shown. 



We examine the discrete and continuous Penna mod- 
els in turn to examine how they behave under rescaling. 
Informed by this behaviour we interpret the continuous 
model as the limit of a discrete model with a vanishingly 
small mutation rate. 



A. The discrete Penna model 

The traditional asexual Penna model has one unit of 
discretization for each unit of time. It is possible to 
rescale the discrete Penna model so that each unit of 
time is broken up into several timesteps. This can be 
done by taking a Penna model with a maximum lifes- 
pan of a/ max and rescaling I so that, in the rescaled time 
units, the model has maximum lifespan Z max and a dis- 
tinct timesteps in one unit of time. For example, an 
! max = 30 model could be rescaled to give an l max = 15 
model with two timesteps per unit of time. 

When discussing rescaled Penna models we require 
that the steady-state conditions are invariant under 
rescaling. For a population with Z max = 60, the steady- 
state conditions should be the same regardless of how 
many timesteps one unit of time has been broken up 
into. For steady-state conditions to be invariant under 
rescaling the population with genetic lifespan l max must 
be self-sustaining and all other populations partly depen- 
dent on mutation. The first condition can be written as: 



the model a population can be uniquely identified by the 
maximum genetic lifespan it can maintain. 

For rescaled models to be the same they should give 
the same population sizes at comparable points up to an 
arbitrary scaling factor. If the discrete Penna model is 
scale invariant, it should be possible to rescale a model to 
obtain an unsealed model with shorter Z max . For instance: 
an ^ma X = 30 model with scaled by a factor of 2 will have 
a rescaled maximum lifespan of 15; if the Penna model 
is scale invariant this rescaled model will, at comparable 
points give identical results to an unsealed / max = 15 
model (up to a constant normalisation factor for finite 
size scaling). Where 7130 denotes a model with unsealed 
maximum lifespan 30, we require that n^o^l) / ni^il) is 
constant. In a general case for models to be identical 
after scaling we require that 

"«l n ,Bx( a ') Kfl l„»x(')- 

This can be satisfied, eliminating the constant of propor- 
tionality by 

In the case of a — 2 we require that 
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A model rescaled by a factor a will allow n(/ max — -) to 
exist, where a gives the number of units of discretization 
per time interval. The same conditions, equation (1 lKill . 
apply as before. A population is then identified by the 
largest (unsealed) value of Z max it can sustain. When 
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For a Penna model to have Z max a factor of a greater, 
the mutation rate and birth rate must be a factor of a 
smaller. If the parameters of the model which is rescaled 



steady-state is required to be robust under rescaling of are labelled as I', /3', m! and &'; then scaled and unsealed 
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parameters are related by: 
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Application of these scaling rules; the recursion rela- 
tion between successive sub-populations at steady-state; 
and our condition for scale invariance of the model gives a 
relation, in terms of birth and mutation rate, which must 
be satisfied for the discrete model to be scale invariant. 

For a rescaling by a factor of 2 we require that 
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This equality cannot be satisfied due to the factor of 
e -/3/2 on t ne bottom of the recursion relation. As such, 
the discrete Penna model does not exhibit scale invari- 
ance. In the limit of a vanishing mutation rate: e _/3 
approaches unity, the differences between scaled and un- 
sealed models vanish and the discrete model will become 
scale invariant. Fig. [3 confirms that the scaled results of 
the discrete model do approach a limiting 'master curve'. 
This limit is the same as that which gives the continu- 
ous model, so we expect to find the continuous model to 
be scale invariant. Note, only comparable points have 
been plotted and distributions have been normalized so 
J2i a~ 1 n(l) — 1 where a is the scale factor. 



B. The continuous Penna model 

As in the discrete Penna model, we identify a popu- 
lation by the largest value of / max it can sustain. This 
value is no longer constrained to be an integer and can be 
simply expressed as / max = ^ • Rescaling of a continuous 
Penna model is carried out in much the same way as in 
the discrete case: i max is divided by a scale factor a and 
the new model has a correspondingly reduced maximum 
lifespan. In the continuous model time is not broken into 
distinct timesteps, but is treated as a continuum: as a 
result rescaling will not alter the number of timesteps in 
one unit of time. If a continuous model is to be invariant 
under rescaling by a factor a, through similar reasoning 
as in the discrete case, 
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Upon substitution of the steady-state relation for contin- 
uous Penna model populations, this is satisfied by 
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The mutation rate and birth rate in the rescaled model 
are labelled (3' and b' . If, by rescaling the model by a fac- 
tor a, the maximum lifespans are to be the same then the 
mutation and birth rates must be related by: (3' — (3 /a, 
b' = b/a. After this substitution the continuous model is 
clearly scale invaraint as both sides of the equation give 
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It has been shown that in the limit of a vanishing mu- 
tation rate coupled with an infinite maximum lifespan, 
the discrete model becomes a continuous one. In other 
words, for a vanishing mutation rate, the discrete model 
becomes scale invariant. As the limits of a vanishing 
mutation rate and maximum genetic lifespan tending to 
infinity are equivalent, approximate scale invariance be- 
comes more realistic for discrete Penna models of increas- 
ingly large Z max . 



IV. MORTALITY RATES 

Early age Penna mortality rates display the exponen- 
tial growth predicted by the Gompertz law. Using our 
analytical solution to the simple Penna model we evalu- 
ate the growth exponent 7 where the mortality rate at 
age x is proportional to e JX . Evaluation of the Gompertz 
growth rate in terms of Penna model parameters will fa- 
cilitate the fitting of Penna parameters to real world data. 
Throughout we assume that any model has adopted the 
maximum genetic lifespan allowed by its mutation rate. 

Recall that for the simple discrete Penna model the 
mortality rate is given by 
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Using the steady state recursion relation from the sim- 
ple Penna model, the ratio between successive mortality 
rates can be evaluated analytically 
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FIG. 3: The exponential coefficient of Gompertz growth in 
mortality rate estimated from early age mortality rates (x) 
is plotted against the maximum lifespan the population can 
sustain (l max ). The dashed line gives the birth rate at each 
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To usefully exploit this expression, we consider the limit 
of small x, and small (3 where the Penna model becomes 
scale invariant; numerical evaluation of the summation 
term and predicted scaling behaviour can be used to sim- 
plify equation 1)21) \ . Numerically, we find for x >C l ma x 
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Crucially, if the Penna model exhibits universality as dis- 
cussed earlier, this result remains valid for all values of 
( mal . Therefore, noting the continuous Penna model re- 
sult lmax = 1//3 and b = (3e, in the regime of small 
x <C l-max, a first order expansion of equation (|20|l leads 
to 
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namely the Gompertz law, which states that the mortal- 
ity rate increases exponentially at early ages. Further- 
more, it predicts that the exponential coefficient of the 
Gompertz growth rate is given by b, the birth rate. In 
Fig. |21 we compare this birth rate with the exponential 
Gompertz coefficients, extracted by taking the difference 
between the logs of mortality rates at ages x = 2 and 
x = 1 for each population. 

Our approximation depends on x <C ^ m ax, therefore, 
deviation from Gompertz behaviour at later ages (large 
x) is expected as the numerical approximation, equation 
l|21Jl . breaks down as x increases. Similarly, as shown in 
Fig. |3 for small values of Z max this approximation works 
less well but for larger values of Z max it becomes increas- 
ingly accurate. 

V. CONCLUSION 



We have shown by means of exact analytic solution 
that, in the asexual Penna model, a universal scale in- 
variance emerges as we increase the number of genome 
bits/sites, with the invariance becoming exact in the limit 
of the continuum model. In addition, we have built on 
this result and shown that scale invariance may be em- 
ployed to derive an analytical expression for the Gom- 
pertz law of mortality, which has been generally regarded 
as empirical. 
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